A Study of the Effects of Ordering, Partitioning and Factorization Algorithms on Distributed Sparse Cholesky Factorization
نویسندگان
چکیده
In this paper, we perform a comprehensive evaluation of ordering, partitioning, and factorization algorithms under a uni ed framework. Previous research in distributed, sparse Cholesky factorization has considered each of the stages in the factorization process | ordering, partitioning and numerical factorization | in isolation. However, due to the strong dependencies between the stages, it is di cult to derive conclusions from such an approach. Speci cally, in our experiments, we use input les representative of practical problems to study the bene ts and shortcomings of ordering algorithms | multiple minimum degree and spectral nested dissection, column-based partitioning algorithms | wrapped mapping and proportional mapping, and block-based partitioning. We use, as the base for our experiments, three factorization algorithms: Fanin, Fanout and Block Fanout. In addition, we extend the research of Rothberg[19] by performing an in-depth analysis of the advantages of panel-based partitionings versus column-based partitionings on message-passing machines. All experiments were run on a Thinking Machines CM-5.
منابع مشابه
A PERFORMANCE STUDY OF SPARSE CHOLESKY FACTORIZATION ON INTEL iPSC/860
The problem of Cholesky factorization of a sparse matrix has been very well investigated on sequential machines. A number of efficient codes exist for factorizing large unstructured sparse matrices, for example, codes from Harwell Subroutine Library [4] and Sparspak [7]. However, there is a lack of such efficient codes on parallel machines in general, and distributed memory machines in particul...
متن کاملScalable Parallel Algorithms for Solving Sparse Systems of Linear Equations∗
We have developed a highly parallel sparse Cholesky factorization algorithm that substantially improves the state of the art in parallel direct solution of sparse linear systems—both in terms of scalability and overall performance. It is a well known fact that dense matrix factorization scales well and can be implemented efficiently on parallel computers. However, it had been a challenge to dev...
متن کاملEfficient Parallel Solutions of Large Sparse Spd Systems on Distributed-memory Multiprocessors
We consider several issues involved in the solution of sparse symmetric positive deenite systems by multifrontal method on distributed-memory multiprocessors. First, we present a new algorithm for computing the partial factorization of a frontal matrix on a subset of processors which signiicantly improves the performance of a distributed multifrontal algorithm previously designed. Second, new p...
متن کاملA Reordering and Mapping Algorithm for Parallel Sparse Cholesky Factorization
A judiciously chosen symmetric permutation can signiicantly reduce the amount of storage and computation for the Cholesky factorization of sparse matrices. On distributed memory machines, the issue of mapping data and computation on processors is also important. Previous research on ordering for paral-lelism has focussed on idealized measures like execution time on an unbounded number of proces...
متن کاملEffective Preconditioning through Ordering Interleaved with Incomplete Factorization
Consider the solution of a sparse linear system Ax = b when the matrix A is symmetric and positive definite. A typical iterative solver is obtained by using the method of Conjugate Gradients (CG) [15] preconditioned with an incomplete Cholesky (IC) factor L̂ [4]. The latter is an approximation to the (complete) Cholesky factor L, where A = LL . Consequently, the process of computing L̂ relies to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996